Classifying ReachOut posts with a radial basis function SVM

نویسنده

  • Chris Brew
چکیده

The ReachOut clinical psychology shared task challenge addresses the problem of providing an automatic triage for posts to a support forum for people with a history of mental health issues. Posts are classified into green, amber, red and crisis. The non-green categories correspond to increasing levels of urgency for some form of intervention. The Thomson Reuters submissions arose from an idea about self-training and ensemble learning. The available labeled training set is small (947 examples) and the class distribution unbalanced. It was therefore hoped to develop a method that would make use of the larger dataset of unlabeled posts provided by the organisers. This did not work, but the performance of a radial basis function SVM intended as a baseline was relatively good. Therefore, the report focuses on the latter, aiming to understand the reasons for its performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of Principal Component Analysis and Orthogonal Least Square on Optimized Feature Set in Classifying Asphyxiated Infant Cry Using Support Vector Machine

Received Aug 26, 2017 Revised Nov 2, 2017 Accepted Nov 20, 2017 An investigation into optimized support vector machine (SVM) integrated with principal component analysis (PCA) and orthogonal least square (OLS) in classifying asphyxiated infant cry was performed in this study. Three approaches were used in the classification; SVM, PCA-SVM, and OLSSVM. Various numbers of features extracted from M...

متن کامل

Novel Optimization Technique for Classification of Remote Sensing Data Using Svm

Remote sensing data is a collection of images and interpretation of information about an object, area, or event without any physical contact with it. Aircraft and satellites are common remote sensing platforms for earth and its natural sources. Remote sensing’s ability to identify and monitor land surfaces and environmental conditions expanded over years with remote sensed data being essential ...

متن کامل

Classification of Sperm Whale Clicks (Physeter Macrocephalus) with Gaussian-Kernel-Based Networks

With the aim of classifying sperm whales, this report compares two methods that can use Gaussian functions, a radial basis function network, and support vector machines which were trained with two different approaches known as C-SVM and ν-SVM. The methods were tested on data recordings from seven different male sperm whales, six containing single click trains and the seventh containing a comple...

متن کامل

Optimizing Support Vector Machine for Classifying Non Functional Requirements

Problems faced in contemporary practice should be understood to improve requirements engineering processes. System requirements are descriptions of services provided by a system and operational constraints. NonFunctional Requirements (NFR) defines overall qualities/attributes of the system. NFR analysis is a significant activity in this branch of engineering. In this study, a methodology for cl...

متن کامل

Text classification: A least square support vector machine approach

This paper presents a least square support vector machine (LS-SVM) that performs text classification of noisy document titles according to different predetermined categories. The system’s potential is demonstrated with a corpus of 91,229 words from University of Denver’s Penrose Library catalogue. The classification accuracy of the proposed LS-SVM based system is found to be over 99.9%. The fin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016